Time evolution of epidemic disease on finite and infinite networks 
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Mathematical models of infectious diseases, which are in principle analytically tractable, use two 
general approaches. The first approach, generally known as compartmental modeling, addresses 
the time evolution of disease propagation at the expense of simplifying the pattern of transmission. 
The second approach uses network theory to incorporate detailed information pertaining to the 
underlying contact structure among individuals while disregarding the progression of time during 
outbreaks. So far, the only alternative that enables the integration of both aspects of disease prop- 
agation simultaneously while preserving the variety of outcomes has been to abandon the analytical 
approach and rely on computer simulations. We offer a new analytical framework, which incorpo- 
rates both the complexity of contact network structure and the time progression of disease spread. 
Furthermore, we demonstrate that this framework is equally effective on finite- and "infinite" -size 
networks. This formalism can be equally applied to similar percolation phenomena on networks in 
other areas of science and technology. 

PACS numbers: 89.75.Hc, 87.23.Ge, 05.70.Fh, 64.60.Ak 



I. INTRODUCTION 

The spread of communicable diseases is a dynamical 
process and as such, understanding and controlling in- 
fectious disease outbreaks and epidemics is pertinent to 
the temporal evolution of disease propagation. Histori- 
cally, this aspect of disease transmission has been studied 
using coarse-grained dynamical representation of popu- 
lations, known as compartmental models [TJ [21 03 HI [5] . 
In these models, a population is divided into a number of 
epidemiological states (or classes) and the time evolution 
of each is described by a differential equation. 

Although this approach, and its more complex vari- 
ants, has been instrumental in understanding several 
features of infectious diseases over the past 3 decades, 
it comes with a major simplification. The simplifying 
assumption states that the population is "well mixed", 
i.e., every infectious individual has an equal opportunity 
to infect others. This assumption may be valid in the 
broader context of population biology. Human popula- 
tions, however, tend to contact each other in a hetero- 
geneous manner based on one's age, profession, socio- 
economic status or behavior, and thus, the well-mixed 
approximation cannot portray an accurate image of dis- 
ease spread among humans [6]. 

Recent advances in network- and percolation-theories, 
have paved the way for physicists to bring a new per- 
spective to understanding disease spread. Over the past 
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decade, seminal works by Watts and Strogatz on small- 
world networks [7J [8] , Barabasi et al. on scale- free net- 
works [9] and Dorogovtsev, Mendes [10], Pastor-Satorras 
and Vespignani [XT] , among others, on the dynamics of 
networks, have shed light on a number of intriguing as- 
pects of epidemiological processes. In particular, ground- 
breaking work by Newman et al. [12J[T3][T4] has provided 
a strong foundation for the formulation of epidemiologi- 
cal problems using tools developed by physicists. 

Various dynamical processes that propagate from 
neighbour to neighbour on complex (natural or artifi- 
cial) networks, e.g., a crawler (or spider) browsing the 
World Wide Web or rumors spreading in a population, re- 
veals interesting similarities with the spread of epidemics 
in human population p~5j [16]. In the present work, we 
specifically focus on disease propagation as the dynami- 
cal phenomenon and use the associated terminology. Our 
methodology is however quite general and can be applied 
mutatis mutandis to other processes that manifest simi- 
lar dynamical properties. 

We are primarily interested in diseases where infected 
individuals are eventually removed from the dynamics 
of the system (i.e., infection is followed by naturally- 
acquired immunity or death), implying that the same 
person cannot be infected more than once. At any given 
time, we call an individual "susceptible" if he has never 
been exposed to the disease; "exposed" if he has acquired 
the infection but not currently able to pass on the disease 
to another person; "infectious" if he is currently able to 
transmit the infection to others; "removed" if he became 
immune or succumbed to death after acquiring the infec- 
tion; and finally, "infected" if he has been exposed to the 
infectious agent at least once in the past, regardless of 
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his current state (e.g. exposed, infectious, removed). 

Network analysis using the generating function formal- 
ism, developed by Newman et al., has proven to be a 
powerful tool when analyzing the spread of disease within 
networks [13]. Without directly addressing the question 
of "when the transmission occurred?", it provides reli- 
able results on the final size of an outbreak/epidemic 
by addressing the question of "whether transmission oc- 
curred?". The first question, i.e. the time evolution of 
the system, is presently beyond the formalism as origi- 
nally derived. 

Several researchers have recognized the importance of 
incorporating the notion of time into the generating func- 
tion formalism that describes percolation dynamics on 
networks. In broaching this issue, many notable advances 
have been made. Numerous contributions (for a recent 
review, see [17]) have used an approach closely related 
to compartmental models to assign a higher strength of 
infection to nodes of higher degree p~8j [19j [20] . More re- 
cently, Marder [21] has calculated the probability distri- 
bution of outbreak sizes as a function of time for infinite- 
size networks, while Volz [22] has addressed the finite-size 
effect by deriving estimates of the mean size of a large- 
scale epidemic, rather than the probability distribution. 
Despite these advances, one is required to develop a truly 
integrated analytical framework that simultaneously en- 
compasses the time progression of disease, the network 
finite-size effects and the wide variety of possible out- 
comes. The major steps towards an integrative formalism 
are the principal achievement of this paper. 

The outline of the paper is as follows. In Sec. [TT] we 
define the type of dynamics that will be studied and re- 
call some of the tools used by Newman [13] for infinite 
networks, to which we add the concept of generations 
and phase-space representation. In Sec. Ill we extend 
these tools for uses on finite-size networks and Sec. [IV] 
presents some results of our analysis, compares them with 
some existing models, and discuss possible extensions of 
the formalism. In Sec. [V] we give our conclusions and 
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II. FORMALISM FOR INFINITE NETWORKS 

We map a system of N individuals to a network in 
which each individual is represented by a node (or ver- 
tex) and the connection between each pair of individuals 
is represented by a link (or edge). Two nodes are neigh- 
bours if they are joined by a link. Contrary to compart- 
mental approaches, a network representation of a sys- 
tem takes into account that each node does not have the 
same probability of interacting with every other node; in 
fact, one interacts only with its topological neighbours. 
Each node has a degree k{ (number of neighbours) and 
the set {hi} (called degree sequence) partially defines the 
network. Similarly, the set of probabilities {pk} that a 
random node has degree k is the degree distribution. 

In many practical situations, the degree distribution is, 



together with the size N of the network, the only avail- 
able information on the network structure. We consider 
the ensemble of all possible networks obtained by drawing 
a degree sequence from the provided degree distribution 
and then, for every node i G {1,2,..., TV}, randomly con- 
nect each of its k{ links to those of other nodes (no self- 
loops) [13] . The quantities obtained through this paper 
are averages on this ensemble. We add to this maximum 
entropy definition the restriction that two nodes cannot 
share more than one link (no repeated links). For sparse 
graphs — in which the number of links scales linearly 
with the number of nodes — the probability for such an 
event decreases as 1/N and can be neglected for large 
networks [T2] . 

To perform Monte-Carlo simulations of epidemic prop- 
agations on a network, one requires an explicit knowledge 
of that network structure. We have used the following 
method [13] to produce a network belonging to the en- 
semble described above: 

i. generate a random degree sequence {hi} of length 
N subjected to the degree distribution {pk}] 

ii. make sure that ki is an even number since a link 
is composed of 2 "stubs" ; 

hi. for each z, produce a node with ki stubs; 

iv. randomly choose a pair of unconnected stubs and 
connect them together. Repeat until all uncon- 
nected stubs are exhausted; 

v. test for the presence of self-loops and repeated 
links. Remove the faulty stubs by randomly choos- 
ing a pair of connected stubs and rewire them to 
the former stubs. Repeat until no self- loop and/or 
repeated links are found. 



A. Basic generating functions 

With knowledge of the degree sequence of the physical 
or social network of interest, we can obtain the corre- 
sponding degree distribution {pk}- Following Newman 
et al. [12] and Newman [13], we define the probability 
generating function (pgf) [23] for a random node as 



Go(x) = ^p k x k , 



(1) 



respecting the normalization Go(l) = YlkLoPk — 1- The 
average degree, zi, can be easily obtained from 



(k) = J2k Pk = G f (l) 



(2) 



k=0 



where the prime denotes the derivative with respect to 
the argument. The probability, that k nodes could 
be reached from the node we arrived at by following a 
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random link (excluding this link from the count), can 
also be derived as 



Gi(x) = £ 



qkx 



k=0 



G' (x) 



(3) 



While Go (x) and G\ (x) contain information about the 
structure of the physical network linking nodes within 
the epidemiological system, they do not contain any in- 
formation about the risk of disease transmission between 
two neighbouring nodes. However, with the additional 
knowledge of the transmissibility, T — the probability 
that an infectious node will infect one of its neighbors — 
we can determine the probability of infecting I out of k 
neighbors as 



Pl\k 



(^JT 1 (1-T) k - 



The pgf for the number of infections directly caused by 
the initially infected node ("patient zero") is then 
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G (l + (x- 1)T) 



Pk \ { )T L {l-T) k - L x l 



(5) 



Similarly, the probability distribution for the number of 
infections directly caused by a node reached by following 
a random link is generated by G\(l + (x — 1)T). 

We can continue in the same vein and obtain informa- 
tive results about the final state of the population in an 
infinite network, after the outbreak/epidemic has ended 
[6l [13]; however, this approach in itself does not yield 
any information about the duration of the epidemic, its 
speed of propagation or other time-related quantities. 



B. Generations 



As in Eq. (J5|, the pgf for the number of nodes that 
acquire infection directly from a single node of generation 
g is given by G g (1 + (x — 1)T). This assumption holds 
when the total number of infected nodes in the current 
and previous generations is small compared to the size, 
TV, of the network. In such a case, the probability of 
infecting a node that is already infected is proportional to 
1/iV; we assume that a node cannot be infected twice and 
the propagation of the disease follows a tree-like structure 
(without a closed loop). This condition is fulfilled in 
large networks either when there is no giant component 
or when we limit ourselves to the first few generations. 
Section [TTT1 removes these limitations to some extent. 

From the properties of the pgfs, the expected num- 
ber of secondary infections caused directly by an infected 
node in generation g is given by 



(4) <y = 



dG g (1 + (x - 1)T) 



dx 



TG' (1) (g = 0) 
rGi(l) (<?>!) 



(7) 



Notice that (l g ) is identical for every generation, except 
the first one. It corresponds to a fundamental quantity 
in epidemiology, the basic reproductive number [TJ |3] 



Ro = (h) = (h) 



TG' 1 (l)=T 



Z2 



(8) 



where Z2 = Gq(1) is the expected number of second 
neighbors for a randomly chosen node. Ro < 1 implies 
that the expected number of infectious nodes decreases in 
consecutive generations, leading to the extinction of the 
disease. Conversely, Ro > 1 implies that the expected 
number of infectious nodes increases and can potentially 
lead to an epidemic, which is a giant component of occu- 
pied links in the language of percolation theory [24 . It is 
worth noting that Ro > 1 alone does not guarantee the 
occurrence of an epidemic; indeed, some realizations may 
have a number of new infections below the mean value 
i?o? and therefore lead to the extinction of the disease. 



To study the progression of the outbreak/epidemic 
over the network, we adopt an approach based on gener- 
ations of infection. We define generation as the initial 
infected node of the outbreak/epidemic; nodes of gener- 
ation g are those who acquire the disease from a member 
of generation g — 1. 

There is clearly a causality link among generations 
and we would expect nodes of higher generations to be- 
come infected, on average, later than those in generations 
closer to the initial infected node. In a future contri- 
bution, we will look more closely into the relationship 
between generations and continuous time evolution. 

We now extend the generating function formalism to 
introduce a new pgf for an arbitrary generation g 



G 9 {X) 



G (x) 
G^x) 



(9 = 0) 
07 >1) 



(6) 



C. Phase-space representation 

To proceed further, we define the quantity ^f m as the 
probability of having s infected nodes by the end of the 
g-th generation, of which m became infected during the 
#-th generation. This probability is generated by 



n(x, y ) = j2^ xS y r ' 



(9) 



Each element of the (triangular) matrix, ^fm? can be 
seen as a possible "state of infection" where the s and 
m dimensions provide information about the "position" 
(number of infected) and "momentum" (new infections) 
in the infection space, respectively. 

We know from the previous sections that the proba- 
bility distribution for the number of nodes that acquire 
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infection directly from a single node of generation g — 1 
is generated by G g -\ (1 + (x — 1)T). Moreover, the pgf 
for the sum of two independent quantities is given by the 
product of their pgf's [T2j [23] . Therefore, the probability 
P(m\s f ,777/) that each state (s f ,m f ) of generation g — 1 
leads to m new infections in generation g is generated by 



E P(mW, m')x m = [G g -! (l + (x- l)T)] r 



(10) 



Also, the state (s* \m f ) has probability V^m 7 a t genera- 
tion g—\ and thus makes a contribution ^7^, P (rn\s' ', ra') 
to the state (5 = s' + m, m) at generation g. Hence, we 
obtain the recurrence relation 



E [^-1 ( J + (*» - wp' = E^ 



a.* y "> 



(11) 



with the initial condition ^!? m = <5 s i5 m i ((% is the Kro- 
necker delta) . The states for which m! = are absorbing 
states; the region above the main diagonal (m r > s') is 
forbidden; and the main diagonal (m f = s f ) is only ac- 
cessible for the initial condition. 

Finally, inserting Eq. (11) into Eq. ([9| provides the 
(forward) recurrence relation (for g > 1) 

9 a {x,y) = E ^' [G fl -i (1 + (*!/ - l)T)] m ' 



(12) 



with the initial condition ^(x, 2/) = xy or ^ = 5 s i5, 



ml 5 



where 5^- is the Kronecker delta. Equation (12) implies 
further the recurrence 

n(x,y) = ^ 9 - 1 (x,G g - 1 (l + (xy-l)T)) . (13) 

Note that ^q(x, 1) generates the probabilities 



are obtained by creating an ensemble of 10 3 equivalent 
graphs, each of which was used to run 10 5 simulations, 
performing 10 8 epidemic simulations in total [32] . 

Figure [2] clearly demonstrates that apart from the 
small-scale outbreaks, the results from the infinite- 
size formalism may not correctly predict the out- 
break/epidemic size distribution for finite-size networks 
when the fraction of the network that has been infected 
is no longer negligible. The remedy to this shortcoming 
is offered in the next section. 



III. FORMALISM FOR FINITE NETWORKS 

As long as one is only interested in the initial stage of 
an outbreak, the finite-size of a network has negligible 
effects on the dynamics of disease spread. However, the 
impact of finite-size effects becomes important when a 
sizable fraction of the network has been affected. While 
the size of small outbreaks is mostly governed by stochas- 
tic fluctuations, the size of the giant component (when 
one exists) is limited by two principal finite-size effects: 
the evolution over time of the degree distribution of sus- 
ceptible nodes and the failure of transmission due to the 
impossibility of re-infection. 

Since Eq. (12) is exact in the infinite limit, we can 



search for a similar form where the finite-size effects are 
introduced as a dependency in s' and m! of the degree 
distribution and/or of its parameters. In the following, 
we describe these effects and how they are introduced 
into the formalism described in the previous section. 

It is worth noting that the finite-size effects consid- 
ered here, affecting directly the dynamics on the network, 
should be distinguished from those that alter the struc- 
ture of the network, for instance through a cutoff in the 
degree distribution as done in [25 . 
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sm 



(14) 



that s nodes are infected at generation g, independent of 
the number of new infections. Similarly, ^q(1,t/) gener- 
ates the probabilities 
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sm 



(15) 



that m nodes are infected during generation g. 

Figure [T] illustrates some results of this method for a 
network of N = l 000 nodes with a power law distribu- 
tion pk ex k~ T e~ k l K with r = 2, n = 5 and the transmis- 
sibility T = 0.8. The phase-space representation for gen- 
erations 2, 6 and 11 as well as the final state are shown. 
Figure [2] provides the corresponding projection p s on the 
s axis (solid curves). The numerical results (crosses), 
theoretical "infinite-size" outbreak distribution (dashed 
curves) and theoretical "infinite-size" epidemic size (ver- 
tical dashed lines) are also displayed. Numerical results 



A. Evolution of the degree distribution of 
susceptibles 

As the disease progresses across the network, suscepti- 
ble nodes with a higher degree of connectivity are more 
likely to acquire the disease than those with fewer con- 
nections. If we focus only on the degree distribution of 
susceptible cases, the distribution will vary over time; 
the portion representing high-degree susceptibles will de- 
crease and the segment representing low-degree suscepti- 
bles will increase, to comply with normalization require- 
ments. This variability over time has a direct effect on 
the ratio Z2/21 and can lower the reproduction number, 
i?o? below the threshold value of 1. These effects can 
potentially cause the extinction of the disease although 
a high number of susceptible nodes is still remaining. 
This is particularly important for degree distributions in 
which some nodes have a degree much higher than the 
mean degree distribution (e.g., power-law distribution): 
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FIG. 1: (Color online) Representation of (s,m) phase space for the infinite-size network algorithm: s denotes the number 
of infected nodes by the end of the g-ih generation and m denotes the number of new infections that occured since the last 
generation. The degree distribution of the N — 1 000 nodes follows a power-law pk oc k~ T e~ k ^ K with r — 2 and k, = 5 and the 
probability of transmission along an edge is T = 0.8 . The phase-space representations, Eq. (12), are displayed, for generations 
2, 6, 11 and the final state. 



the removal of these nodes will significantly impact the 
connectivity of the network. 

To take this effect into account we define the generating 
function for the degree distribution of the remaining sus- 
ceptibles for the current size, s, of the outbreak/epidemic 



G s (x;s) = J2p S k(s)x k 



(16) 



The mean number of susceptibles of degree k is thus given 
by (N — s)pfr(s). However, the actual number of sus- 
ceptibles in a network characterized by Gq (x\ s) will in 
general be different from the mean value. Nonetheless, 
the difference becomes negligible in a sufficiently large 
population. In this limit, each (s) can be treated as a 
continuous function of its parameter s. The assumption 



of a large population is less restrictive than it appears at 
first glance and, by comparison with the results of nu- 
merical simulations presented at the end of this section 
(Fig. [3]), we can say that it holds for reasonably small 
populations (e.g. N = 1 000 for Fig. [3|. 

Since (s) must be normalized (5^ fe pf (s) = 1) and 
because the susceptibles of degree k have a probability 
k times greater of being newly infected than those of 
degree 1, we can derive a differential equation system for 
the evolution of (s) 



ds 



N • 



(17) 
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FIG. 2: (Color online) Projection p g s on the s axis of the 
(s,ra) phase space for the infinite-size network algorithm: s 
denotes the number of infected nodes by the end of the g-th 
generation and p 9 s denotes the probability for s to occur. The 
degree distribution and transmissibility of Fig. [I] are used 
and the generations shown (2, 6, 11 and final state) are also 
the same. The value of p 9 s is plotted against s (solid curves). 
The numerical results (dots), theoretical "infinite-size" out- 
break distribution (dashed curves) and theoretical "infinite- 
size" epidemic size (vertical dashed lines) are also displayed. 
Numerical results are obtained by creating an ensemble of 
10 3 equivalent graphs, each of which is used to run 10 5 sim- 
ulations, performing 10 8 epidemic simulations in total. The 
discrepancies are explained and corrected in Sec. |III| 



where the average degree is defined by 
sf(s) = 5>p£(s) . 



(18) 



In the present dynamics, the first infection targets a 
random susceptible node and thus, does not affect the 
degree distribution. Therefore, we use the initial condi- 
tion (1) = Pk (with pk being the degree distribution of 
the whole network) in Eq. (17) to get the solution 



p*«=p*S-4p«] fc 



' N ■ 



with 9(s) given by 
6(s) = exp 



ds' 



! (N-s')zf(s') 



(19) 



(20) 



The normalization of (s) leads to the convenient ex- 
pression 



]>> [0(s)] k = G o (0(s)) 



N- 



N ■ 



(21) 



For example, using the Poisson distribution 

p k = e~ z z k /k\ 



in Eq.(21 ) gives 



N-s 
N - 1 



k=0 



fc! 



e -z e z9(s) 



from which we can easily isolate 



6(s) = 



1 



z + ln 



N-s 
N — 1 



(23) 



(24) 



(25) 



It follows that the degree distribution of the susceptibles 
is also a Poisson distribution 



3fM 



[Z?(8)] k /k\ 



with an average degree given by 

'N-s 



In 



N — 1 



(26) 



(27) 



Inspection of this last expression reveals that zf(s) be- 
comes negative when TV — s < (N — l)e~ z . This limitation 
is due to the fact that the large population assumption is 
no longer respected: it implies the presence of (N — l)e~ z 
nodes of degree zero, but these nodes cannot be infected 
by the process leading to Eq. (17). Nevertheless, the 



probability of an epidemic reaching such high values of 
s typically vanishes in most realistic cases. Table [I] com- 
piles the analytic forms of 0(s) for some typical distri- 
butions. When a closed form satisfying Eq. ( [21] ) cannot 
be found, the quantity p^ (s) can nevertheless be derived 
numerically for each pair of k and s. 

Figure [3] shows the variation of the degre e distributions 
of susceptible nodes for 3 n etwo rks: |(a)| the power-law 
distribution used in Fig. [lj |(b)| a bino mial distribution 
with p = 6/N and N — 1 000; and (c) a bimodal distri- 
bution, in which the vast majority of nodes (95%) has an 
average degree of 6 and the rest has an average degree 
of 50. This latter network is particularly interesting as 
it may correspond to realistic settings such as hospitals, 
schools or shopping malls. Although slight deviations are 
caused by the underlying assumptions, there is very good 
agreement between numerical and analytical results. 

Once Gq (x; s) is known, we can show that the degree 
distribution of the susceptibles in the previous generation 
is given by Gq {x\ s — m). Again using mean- value consid- 
erations, we obtain the degree distribution, Gq(x; s,m), 
of those that became infectious in the last generation, i.e. 



mGo(x; 5, m) = [N — (s — m))Gg (x\ s — m) 

- (N-s)G s Q (x;s) 



(28) 



Moreover, Eq. ( 19 ) allows us to express Gq (x;s) in terms 
of the original Go[x) as 

Gi(x;s) = ?^G o (x0(8)) . (22) 



Gq(#; s, m) — 
(N-l) 



G (x9(s - m)) - G (x0{s)) 



■ (29) 
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TABLE I: Expression for 9(s) for some commonly used degree distributions. 



Degree distribution 



Expression for 0(s) 



Poisson 



e k 



9(a) 



z + \n 



N -s 
N - 1 



Binomial 



, N \ k/-i \N-k 
Pk = | u )P (1 -P) 



N -s 
N - 1 



l/JV 



+ p- 1 



Exponential 



Power law 



p fc = (1 - e 



Li r (e-V«) 



for fc > 1 



iV-l-(s-l)e' 



N-s 

N -s 



Li r ( e -^( s )) = ^ Lir ( c -i/«) 
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(a) Power-law distribution. 



N = 1000 
<fc)=6 
T = 0.2 




(b) Binomial distribution. 
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(c) Bimodal distribution. 



FIG. 3: (Color online) Time evolution of typical degree distri butio ns representing susceptible nodes. The e volu tion of the 
degree distribution of the susceptibles is shown f or 3 networks: (a) the power-law distribution used in Fig. [l] |(b)| a binomial 



distribution with p — 6/N and N = 1 000 ; and (c) a bimodal distribution, in which the vast majority of nodes ( 95% ) has 
an average degree of 6 and the rest has an average degree of 50. The analytical (curves) and numerical (crosses, 10 3 equivalent 
graphs with 10 3 simulations per graph) results are given for different outbreak/epidemic sizes s. The transmissibility values T 
were solely used to produce numerical results. 



The excess degree of the currently infectious nodes is 
therefore generated by 



G g (x;s,m) 



Go Or) 



(9 = 0) 
(5>1) 



(30) 



This distribution can then be used in Eq. (12) as a sub- 
stitute for G g (x) when finite-size effects cannot be ne- 
glected. 



B. Additional loss of transmissions 

For networks of finite size, it is no longer possible to 
completely neglect the effect of closed loops on the dy- 
namics of outbreaks. Indeed, it is possible that some of 
the neighbors of a newly infected node have previously 
been infected and, for dynamics where re-infection is im- 
possible, this implies fewer new infections than would 



have been predicted in an infinite network. Similarly, 
links between two infectious nodes or links from more 
than one infectious node to the same susceptible node 
also reduce the number of new infections. 

Furthermore, since the pair s f and m' completely char- 
acterizes the "state" of the system in an infinite network, 
it still carries a lot of information about the correspond- 
ing state in finite-size networks. We thus make the as- 
sumption that s' and m! are a sufficient basis to incorpo- 
rate the finite-size effects, i.e. the loss of transmissions in 
the finite network that would have occurred in an infinite 
one. 

Our main step is to seek a "mean field" approximation 
(where every parameter other than s 1 and m! is assumed 
to take its mean value) to the ratio 



(m) 
m 



(31) 



of mean number of transmissions rh that actually happen 
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in the finite network to the number of transmissions m 
that would have occurred in an infinite one. If each of 
the m infinite network transmissions is treated as having 
the independent probability 1 — p s >m' of being lost (a 
probability p s > m > of occurring) in the finite network, there 
is then a probability 



rn 



OwHi 



(32) 



that m transmissions occur in the finite network. Simi- 
larly, a subgroup I of the m infinite network transmissions 
will contribute I to the m finite network transmissions 
with probability 



l-l 



(33) 



Although these transmissions are not exactly indepen- 
dent events, the independence assumption is a good ap- 
proximation when the system has many degrees of free- 
dom, i.e., it holds when m is small compared to the num- 
ber of susceptibles of nonzero degree. The probability 
distributions for m (Eq. (32)) and I (Eq. (33)) therefore 
hold unless most nodes of the network have been infected, 



quite unlikely for example in realistic epidemiological ap- 
plications. One of the first effects of a non-negligible cor- 
relation would typically be a reduction of the variance of 
the distributions of m and I. 

Appendix [A] provides an expression for p s 'm' under as- 
sumptions of continuity (a differential equation approach 
similar to the previous section) for a sizable population, 
large enough to obtain meaningful mean values. All but 
one of these mean values are relatively easy to calculate, 
the remaining one is derived in Appendix [B] to complete 
the task. 



C. Phase-space representation in finite-size 
networks 



We now combine the degree distribution of infectious 
nodes 



G g -i(x]s',m') = y^pk(s f ,m f )a 



(34) 



obtained in Sec. IlII Al with the distribution of I to obtain 
the finite-size network counterpart of Eq. ([5| 



CO OO OO / 7 \ /7\ 

E E E^( s '' m ') [i ) Tl{ ~ 1 - T ) fe "' (f) (Ps'm'Yil - Ps>m>) l -~ l x l = G fl _i (1 + (* - l)T Pa , m ,; s' , rri) . (35) 



[=0 1=1 k=l 



This is now the generating function for the number of new 
infections caused by a single infectious node in a finite 
network when the state of the disease is characterized 
by s f and m! (at generation g). Defining an effective 
transmissibility T 3 t m t = T/v m /, the forward recurrence 
relation, Eq. (12), is then replaced by a new expression 



of the same structure 



*80*,y)= E tf~m> x *' 

s' ,m' 

x [(5 s _i (l + (xy - l)f s , m , ; s', m') 



, (36) 



where the evolution of the degree distribution is taken 
into account by the change G g -\{x) — > G g -i(x; s f , m') 
while the additional losses of transmissions are intro- 
duced by the replacement T — > T s / m /. 

Figure |4] depicts the phase- space representation for the 
finite-size power-law network, using the finite-size algo- 
rithm of Eq. ( [36] ) instead of the infinite size approach of 
Eq. (12). The system studied is identical with that of 
Fig. [1] and for comparison the infinite-size calculations 
are superimposed on the new results. Figure [5] presents 
the corresponding projections on the s axis. There is a 



major improvement in the agreement with the numer- 
ical simulations for all generations and the final state. 
Despite the approximations, and the small size of the 
network (N = 1 000), this agreement makes us confident 
that we have captured the major part of the finite-size 
effects. 

Furthermore, while the infinite-size formalism pro- 
duces a single number, S, for the giant component size 
(represented by the dashed vertical line in Fig. [5|, the 
finite-size formalism produces the whole probability dis- 
tribution of sizes above the epidemic threshold. More on 
this in the next section. 



IV. FUTHER RESULTS AND DISCUSSION 

In what follows, we examine in succession the results 
of the finite versus infinite formalism on and around the 
percolation threshold, make a comparison with recent 
dynamical models of disease propagation, establish the 
relationship with the reproductive number of epidemiol- 
ogy and briefly discuss possible avenues of improvements 
concerning clustering and continuous time evolution. 
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FIG. 4: (Color online) Representation of (s,m) phase space for the finite-size network algorithm: s denotes the number 
of infected nodes by the end of the g-th generation and m denotes the number of new infections that occurred since the last 
generation. The degree distribution, transmissibility and symbols are the same as for Fig. [TJexcept that the finite-size algorithm 
( |36| is used for the calculations. For comparison, the dashed curves are contour plots of the previous results for infinite-size 
network (Fig. [I] Eq. fl^). 



A. Behaviour around percolation threshold 



We recall first that the current paper has been strongly 
influenced by the previous works of p~2j [13] . Their for- 
malism allows for the calculation of the whole distribu- 
tion of outbreak/epidemic size and probability at the fi- 
nal state of the dynamics (g — > oo in the language of 
this paper) for infinite networks. It is based on the pgfs 
Ho(xY and H\(x) related to Go(x) and G\(x) of Eqs. 
and (I3| by the relations 



H 1 (x)=xG 1 (1- (l-ffi(:r))T) 
H (x) = xG (1- (l-H 1 (x))T) 



(37) 
(38) 



In particular, the pgf H (x) generates the distribution 
of size for outbreaks from which the average size of out- 
break/epidemic for infinite networks can be expressed as 



(5)00 = ff (lK(l) +N(1- H (1)Y 



(39) 



Note also that above threshold the fraction of nodes be- 
longing to the giant component is given by S = 1— Hq(1); 
below threshold Hq(1) = 1 as it should. 

Since Sec. [U is a direct extension of this formalism 
to discrete time evolution, the values it provides all con- 
verge to that of [13] at the final state of the dynamics. 
Formally, this is no surprise since as g — » 00, the recur- 
rence relationship ( fl2| ) approaches the self-consistency 
expressions (37) and ( [38] ) . In this asymptotic limit, 
Hq(x) = ^qJx, 1). Similar remarks hold for the work 
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FIG. 5: (Color online) Projection p 9 s on the s axis of the 
(s,m) phase space for the finite-size network algorithm: s 
denotes the number of infected nodes by the end of the g-th 
generation and p g s denotes the probability for s to occur. The 
degree distribution and transmissibility of Fig. [I] are used 
and the symbols used are those of Fig. [2] The results pro- 
duced by the finite-size algorithm are in very good agreement 
with the numerical simulations (10 3 equivalent graphs with 
10 5 simulations per graph) over the entire range of possible 
outbreak/epidemic sizes. 



of Marder 
evolution. 



another extension of [13] to discrete time 



By contrast, the formalism of Sec. Ill takes into ac- 



count finite-size effects. Instead of Eq. (|39|), the average 



size of an outbreak/epidemic is now provided by 



(40) 



since the distribution i/j^ is always properly normalized 
with pf = £ m and £,pf = 1. 

An important feature of the new formalism is its abil- 
ity to capture the finite size of an outbreak in the vicinity 
of the critical transmissibility, T c , which separates small 
outbreak and large-scale epidemic zones. This notion is 
depicted in Fig. [6] where we consider the same power- 
law network for different transmissibilities T (T c ~ 0.6). 
In an infinite-size network, we observe a "divergence" 
in the outbreak/epidemic size, (s)oo? just around the 
transmissibility threshold. However, numerical simula- 
tions on finite-size networks (and similarly real-life out- 
breaks) never exhibit such divergence. Figure [6] demon- 
strates that the finite-size formalism can accurately re- 
trieve the size of outbreaks on and around the trans- 
missibility threshold. Away from threshold, infinite and 
finite methods agree with the simulations, but only the 
finite-size approach covers smoothly the complete range 
of transmissibilities without unrealistic divergence. 



120 



100 




FIG. 6: (Color online) Average size of outbreaks and/or epi- 
demics for the power-law network of Fig. [I] (N = 1 000) as 
a function of transmissibility, T. The infinite-size formalism 
provides the position of critical transmissibility, T c c± 0.6080, 
above which epidemics can occur (vertical dotted line), as 
well as the expected size of outbreaks or epidemics (s)oo (Eq. 

(39) , dashed curve). In contrast to the divergence at T c in 
the infinite-size formalism, numerical simulations (circles, 10 3 
equivalent graphs with 10 3 simulations per graph for 100 dif- 
ferent values of T) show a smooth monotonic increase of (s). 
The prediction of the new finite-size algorithm ((s)jv 3 Eq. 

(40) , solid line) is in perfect agreement with numerical re- 
sults. Four other curves provide the results of the models 
presented in Sec |IVB[ 



B. Importance of the variety of outcomes 

It would be informative to compare our method with 
some other propagation approaches on a structured net- 
work. The chosen models all perform some "mean field" 
approximations that prevent them from taking into ac- 
count the wide variety of possible outcomes, which can 
span the whole range from small outbreaks to large-scale 
epidemics. In effect, these models can only provide mean 
values and the quality of the resulting averages is some- 
times difficult to estimate. 

For the purpose of comparison, all the models are de- 
fined for the same network ensemble as the one presented 
in Sec. [IT] and the quantity that we will focus on is the av- 
erage size of outbreaks/epidemics at the asymptotic final 
state of the dynamics. 



1. Volz (2008): A network- centric approach 

Close in spirit with the present contribution, but tech- 
nically very different, is the recent approach derived by 
Volz [22 . The formalism is based on network-centric 
quantities (such as edges linking susceptible nodes to in- 
fectious nodes or to other susceptible nodes) in order to 
introduce time evolution and finite-size effects. Although 
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it is also based on the pgf Go(x) (Eq. Q), it uses a set of 
differential equations to track the evolution of mean val- 
ues. The network-centric quantities are later converted 
to node-centric quantities, such as the number of infected 
or susceptible nodes. The asymptotic (t — > oo) average 
size of an outbreak/epidemic we are interested in can be 
cast in the following form 



(s)voiz = N (1- Go (M«0)) 



(41) 



where Oy and the full set of differential equations are de- 
fined in Appendix CI It is interesting to note that Qyjt) 
is conceptually related to its counterpart 0(s) (Eq. (20)) 
of the finite-size algorithm, although they are obtained in 
a completely different manner. Figure [6] displays (s)voiz 
as a function of T and shows a remarkable improvement 
over the infinite size results. The remaining discrepancy 
with the simulations and (s) n can be attributed to the 
mean field approximation used where only average values 
are evolved in time. 



2. Moreno et al. (2004)-' A compartmental approach 

A special class of models for the dynamical propagation 
on networks is the use of compartmental models where 
nodes of different degree are placed into different com- 
partments [T8l [T9l l20]. We have integrated the SIR-like 
model of [20 , whose description is detailed in Appendix 
|C 2[ to construct the asymptotic average for a population 
N 



(s) c = N^R k (cx)) 



(42) 



of infectious nodes, by one, each time they cause new in- 
fections. Thirdly, the same procedure must be done for 
infectious nodes linked to infectious or removed nodes 
(and applied to these infectious or removed nodes, as 
well). To put it differently, the knowledge that no infec- 
tion occurs carries as much information as the knowledge 
that an infection occurs and both of these scenarios must 
be taken into account when modeling the dynamics of the 
system. 

We have implemented these improvements in the orig- 
inal differential equations (see Appendix |C 3| ) and the 
asymptotic result, say (s)*, is plotted in Fig. [6] The 
agreement is much better and essentially reproduces the 
result of Volz, (s)voiz- It seems that our modifications 
have nicely included the better part of the correct dy- 
namics. 

In contrast to the finite-size formalism that takes into 
account the full stochastic nature of the process, these 
three systems assumes that the "average evolution" oc- 
curs at each time. Such an approach seems to be 
more suitable for an endemic disease in a susceptible- 
infectious-susceptible (SIS) dynamics since the stochas- 
ticity is averaged when many nodes are infectious at the 
same time. When only a small amount of infectious nodes 
are present at some point in the evolution of the disease 
(e.g. the very beginning), the stochastic nature of the dy- 
namics can have tremendous impacts on the long-term 
behavior of the system. In particular, it is those very 
stochastic effects that allow for small components even 
beyond the epidemic threshold. Systems solely tracking 
averages cannot take into account this important behav- 
ior. 



where R k is the fraction of nodes that are recovered and 
of degree k. A slight correcti on to this model has been 
proposed in [31] and Appendix C 2 explains how to obtain 
the corresponding asymptotic average {s)^ v . The results 
seen in Fig. [6] show large deviations of both models long 
before the critical T c for the network and appears valid 
only for small transmissibility. The source of failure in 
reproducing the simulations is quite subtle and has led 
us to propose an improved version. 



3. An improved compartmental approach 



Although the system \Cb\ appears to be a natural 
choice, it neglects some important points. Firstly, all 
infectious nodes (except the first one) become infected 
from one of their neighbors. Since a link between two 
nodes will always remain attached to these two nodes, 
the "effective" degree of an infectious node is (at least) 
one less than its actual degree. We introduce this in the 
model by transferring nodes from S k to I k -i (instead of 
Ik). Secondly, an infectious node that infects a previ- 
ously susceptible node along a given link cannot infect 
again along that same link; we should reduce the degree 



C. Effective reproduction number 



In order to establish a link between the present for- 
malism and the classical epidemiological models, it is 
worth revisiting the interpretation of the basic reproduc- 
tive number — a key parameter in classical epidemiology 
[TJ [5] . We can derive the mean excess degree of the in- 
fectious nodes as , z sm = G^_i(l;s,m), and together 
with the effective transmissibility T sm , obtain the corre- 
sponding effective reproduction number R sm — T sm z sm 
for each (s,m) state. Figure [7] shows the dependency of 
Zsm, T srm and R sm on s and m for the networks intro- 
duced in Fig. [3| The behaviour of the power law dis- 
tribution is dominated by the variability of z smi as T sm 
remains relatively uniform in the vicinity of the epidemic 
threshold. However, the converse is true for the binomial 
distribution; it is the variability of T sm that is respon- 
sible for the behavior of this distribution. The bimodal 
distribution is seen as a mixture of these two behaviors. 
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FIG. 7: (Color online) Effective reproduction number interpretation. For the 3 networks presented in Fig. [3] we show the 
expected excess degree of the infectious z srn , the effective transmissibility T sm and the corresponding effective reproduction 
number R sm for each (s,ra) state. Th left, center, and right columns display the results of the power-law, the binomial, and 



the bimodal distributions respectively. The solid black line in the R s 

Rsm — 1 • 



L sm^sm 



plots corresponds to the "threshold" value 



D. Clustering and continuous time evolution 



Although most real-world contact networks show a 
clustered structure, and ways to incorporate this prop- 
erty into the dynamics are rapidly developing [26, 27, 28, 
[29] . our model does not at present explicitly account for 
clustering. In clustered networks, two neighbors of the 
same node i are more likely to be neighbors of one another 
than of any other random node. In fact these "triangles" 
(or short loops) are not forbidden in the network ensem- 
ble introduced in Sec. [TTJ and over which our formalism is 
defined, but their number, however, is ruled by random- 
ness alone and is less than what would be expected in a 
typical realistic human population for instance. Hence, 
since it has recently been shown that clustering tends to 
decrease the number of infections in epidemic dynamics 
[29] , our model represents a worst case scenario for net- 
works whose clustering properties are stronger than their 



random values. One promising way to improve upon the 
present framework, while keeping the generality of a pgf- 
based formalism and allowing for better treatment of the 
evolution on clustered networks, would be to generalize 
the model to a bipartite graph structure where nodes (one 
part of the network) are assigned to different groups (the 
other part of the network) much in the same spirit as 
originally studied in [30] . 

Another aspect that needs further development is the 
relationship between discrete (generational) and contin- 
uous time evolution. For systems where the underlying 
epidemiological dynamics is a discrete process in time 
(e.g., there is a constant time interval r between suc- 
cessive infections), the generational formalism developed 
in Sec. [TT] directly represent the evolution of the system 
over time (infection of the g-th generation occurs at time 
gr). However, in most natural situations, the underlying 
dynamics is a continuous process. We are actively pur- 
suing the issue of a general continuous epidemiological 
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dynamics; the analysis will be the subject of a forthcom- 
ing contribution. 



APPENDIX A: THE RATIO p s , m , = {(m)/m) , , 
(EQ. 31) 



V. CONCLUSION 

The emergence and re-emergence of infectious diseases 
pose a great threat to public health. The potential spread 
of a new pandemic strain of influenza or other emerging 
infection, such as SARS, may have a devastating impact 
on human lives and economies. There is an urgent need 
to develop reliable quantitative tools that can be used to 
compare the impact of various intervention strategies in 
real time. These tools must be able to incorporate the 
detailed structure of contact networks responsible for dis- 
ease spread, as well as compare various intervention out- 
comes during the time of crisis, in a relatively short time 
span. In addition, these tools should be as equally ap- 
plicable to large-scale networks as to finite-size networks, 
seeing that many interventions must be implemented not 
only globally, but locally (e.g., hospital settings, schools) 
as well. 

In this paper, we have introduced and validated a the- 
oretical framework that enables us to incorporate these 
two important aspects of disease outbreaks/epidemics, 
simultaneously. Specifically an extension to the existing 
formalism has been derived while keeping its appealing 
structure in terms of generating functions. With the in- 
troduction of the concept of generations and phase-space 
representation, Eq. (12) — for the infinite-size network 



with fixed transmissibility T — has been replaced by Eq. 
( [36] ) to account for finite-size effects through a modi- 
fied generating function and an effective transmissibil- 
ity. This is our finite-size, discrete time algorithm. One 
of its important features is its ability to follow the com- 
plete diversity of outcomes, not only averages, from small 
clusters (outbreaks) to the giant component (epidemic) 
within each generation. A complete formalism includ- 
ing both finite-size and continuous time with or without 
correlations between transmissions is still lacking and its 
derivation is part of ongoing research. 
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It is important to take into account that the number 
fa of new infections that actually occur in a finite net- 
work will typically be smaller than the number m of new 
infections that would have happened in an infinite net- 
work. As presented in Sec. |IIIB[ the m links leaving 
infectious nodes and causing new infections in the infi- 
nite network can lead to either susceptible nodes, other 
infectious nodes or recovered nodes. Of these links, only 
those leading to susceptibles nodes can actually result in 
new infections in the finite network and, moreover, a sus- 
ceptible node targeted by more than one of these links 
contributes to only one transmission. 

We note by ns, ni and ur the number of links that 
are not forbidden to join infectious nodes to suscepti- 
ble nodes, other infectious nodes and recovered nodes 
respectively. Since there is no special restriction forbid- 
ding susceptible nodes to be linked to infectious ones, 
ns is simply the sum of the degrees of all susceptibles. 
In the same way, ni is the sum of the excess degree of 
the infectious nodes (since the sole restriction is that, 
for generations other than zero, there is at least one link 
from each infectious nodes to a recovered one). When 
the state of the infection is characterized by s' and m', 
these considerations translate to the mean values 



(n s ) s , m , = (N- S ')G$'(l;s') 
(ni)s'm' =m'G' g _ 1 (l;s',m') 
{m) s 'm' = T(ni) s > m > , 



(Al) 
(A2) 
(A3) 



where G^(x;s f ) and G g -i(x; s', m') are defined in Eq. 
(22) and Eq. (30) respectively. More complicated con- 
straints apply to tir and its mean value (nR) s > m > is ob- 
tained separately in Appendix |B| 

We develop two methods to evaluate p s >m' — 
((ra)/ra) , the ratio of the mean number of new in- 
fections occurring in the finite network to the number 
of new infections that would have occurred in an infinite 
one. Both methods are based on the assumption that the 
m links leading to infections in the infinite limit have the 
same a priori probability of targeting any of the n^, nj 
and ur targets, which is justified when ns + nj + nR is 
large compared to the degree of the node of highest de- 
gree in the network and is typically satisfied unless most 
of the network has been infected. 

The first of these methods is quick and simple. The sec- 
ond method is based on a differential equation approach 
similar to the one used in Sec. MI Al As both methods 
bring the same result, this second approach is an addi- 
tional justification of the differential equation method for 
cases where an alternative approach is not known, as in 
Appendix [B] for instance. 
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1. Direct approach 

Since the m links causing infections in the infinite net- 
work can lead to ns + nj + ur potential targets in the 
finite one, each link belonging to a susceptible node {i.e. 
one of the ns targets) has a probability 



A 



ns + nj + n R 



(A4) 



to be one of these links. If that node is of degree fc, 
there is a probability 1 — (1 — X) k that at least one of 
the m links leads to it, causing a new infection. Using 
the degree distribution of the susceptibles obtained in 
Sec. Ill A and the fact that N — s' susceptibles are left 
when the state of the infection is characterized by s' and 
m', the expected number of new infections in the finite 
network is then 



(m) = (N-s')J2pUs') [l"(l-A) fc 

k 

= (N-s')[l-G s (l-\;s')] . 

Therefore, with the use of Eqs. (|Al}|A5|, 
Ps'm' = {{fh)/m) s is provided by 



1 (N-s') [1-Gff(l 
T 



A.s'r 



rrv 



where 



and (n R ) s 



(m) s 



(A5) 
the ratio 

(A6) 
(A7) 



(ns)s'm' + (ni) s'm' \ y^R/s'r. 
is given in Appendix |B| 

2. Continuous approach 



In this approach, we relax the numerous constraints 
of the problem by changing the task to an "assignment 
process" evolving in time. By analogy, this process is 
similar to the system children use for their casual sports 
team when two leaders choose their team members one 
by one. It should be clear that the time dependency 
introduced in this manner is an artificial construct and 
that only the result at t = oo ("assignments of teams is 
completed") is relevant. 

We define a(t) as the pool of potential infections that 
occur in the infinite network and for which a target has 
yet to be assigned at time t. This definition implies the 
initial condition a(0) = m; we must also have a(oo) = 
(the process must assign a target to each potential in- 
fection). We also define /?(£), the pool of targets for 
which assignment of one of the potential infections does 
not lead to a new infection, yielding the initial condition 
/3(0) = n R + (nj — m). We finally define 7fc(t), the pool 
of targets belonging to susceptible nodes of degree k and 
to whom the assignment of a potential infection leads to 



an actual new infection in the finite network. From mean 
value considerations, 

~f k (0) = (N - s') kp s k (s') (A8) 

witn Z)fc7fc(0) = n S - 

Since the time dependence is arbitrary, we assign tar- 
gets to the potential infections at a rate proportional to 
their current population a(t), which is a simple way to 
obtain a(oo) = 0. With the definition 



;(t) = a(t) + /3(t) + $37 fc (t) 



(A9) 



the probability of assigning a potential infection to other 
potential infections, other targets not leading to infec- 
tions, or to targets leading to infections belonging to 
degree k is a(t)/u(t), f3{t)/uo(t) and 7/ c (t)/o;(£), respec- 
tively. Moreover, upon assignment of a potential infec- 
tion to a target belonging to Jk(t) : we transfer the re- 
maining k — 1 targets to f3{t) to ensure that later as- 
signment to these targets will not lead to new infections. 
These considerations translate into a set of coupled non- 
linear differential equations: 



da(t) 

dt 
d(3(t) 



-a{t) 



a(t) 



d ^=-a(t)^k 



dt 



for all k. By summing these terms, we note that 
duj(t)/dt — —2a(i), together with da(t)/dt, yields the 
solution 



A 



a(t) 

cj^j ~ A + (1-A)e* 



(All) 



where A = a(0)/o;(0) = nr/ins + nj + tir). This result 
allows us to rewrite the equation governing jk(t) as 



d'jkjt) 
dt 



l + (A" 1 -l)e t 



(A12) 



which is completely decoupled from the rest of the sys- 
tem. This equation has the solution 



7fc(*)=7fc(0) (Ae-' + l-A)' 
and produces the limit 

7fe(oo) = 7fc(0) (1 - A) fc 



(A13) 



(A14) 



The number of nodes of degree k that are infected in the 
process is thus 



7fc(0) ~ 7fc(oo) _ 7fc(0) 
k k 
= (N- 



(1-(1-A) fe ) 
*>£(/)(!-(! 



A)* 



(A15) 
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Using the initial conditions, we obtain the average total 
number of infections in the finite network 

(to) = (N- s') ($>£(*') ~ £pf (*')(! " A) fe ) 

V k k / 



(N-s') [1-G S (1-A; S ')] 



(A16) 



Notice that this expression is the sam e as Eq. (A5) and 
thus leads to the same /v m /, i.e. Eq. (A6). 



APPENDIX B: THE AVERAGE VALUE (nj*) s / m / 

While it is quite easy to obtain mean values or even 
distributions for n/, ns and ra, an independent method 
is required to evaluate the quantity ur. Here, we use a 
continuous approach similar to that of Appendix [A) 

Again, we design a differential equation governing the 
evolution of the continuous counterpart of n^, denoted 
77(5), as a function of the number of infections, s. If 
the k — 1 excess degrees belonging to a newly infected 
node, of degree /c, are linked to susceptible nodes, we 
expect a fraction 1 — T of them to be unsuccessful in 
transmitting the infection to others and thus, contribute 
to r](s). However, we expect a fraction, r](s)/ (ns(s)) , 
of the k — 1 excess degrees to be linking to recovered 
nodes, with (ns(s)) = (N — s)zf (s). These last links 
actually reduce the value of 77 (s), because links between 
susceptible and recovered nodes are converted to links 
between recovered and infectious nodes. Defining 



zl(s) = J2k(k-l)p s k (s) 



G s n " 



(Bl) 



the differential equation considering these effects reads 



drj(s) 
ds 



\ - kpf(s) 



Zf(8) 



(k-1) 

(ns(s)) 
(1 - T) - (2 



(1-T) 



r?(s) 



(ns(s)) 



T) 



(ns(s)) 



(B2) 



Together with the initial condition 77(1) = (1 — T)zf (1) = 
(1 — T)z±, this differential equation can be integrated nu- 
merically to provide an expectation value for the number 
of links between recovered and susceptible nodes when 
the size of the outbreak/epidemic is s. 

The quantity (nR) 3 r m r, required to fix A s / m / in Ap- 
pendix |Aj is the mean number of links emerging from re- 
covered nodes that are allowed to target infectious nodes 
when we know the current state of the epidemic is char- 
acterized by s f and ra'. As ra/ is the current number of 
infectious nodes, the size of the outbreak/epidemic was 
s' — ra/ when these infectious nodes were still suscepti- 
ble nodes (in the previous generation). At that time, 
we expect r](s' — ra/) links to join susceptible and recov- 
ered nodes; this number is equal to the expectation value 

{nRjs'm'' 



APPENDIX C: SOME DYNAMICAL MODELS 

The following models are all defined for the same net- 
work ensemble as the one presented in Sec. [Hj i.e. their 
degree distribution {pk} is generated by Gq(x). 



1. Volz (2008): A network-centric approach 

At time t, let 6y(t) be the fraction of degree one nodes 
that remain susceptible, pi(t), the probability that a sus- 
ceptible node be connected to an infectious one and ps(t), 
the probability that a susceptible node be connected to 
a susceptible one, then 



dOy(t) 

dt 
dpi(t) 
dt 



dpsjt) 
dt 



= -rpi(t)O v (t) 
= rps(t)p I (t)6 v (t) 



G>>(e v {t)) 



G' o (0v(t)) 
■rpi(t)(l- Pl (t))-iA Pl (t) 

G'{(O v (t)Y 



, (CI) 



= rp s (t) Pl (t) l-O v (t) 



G f {O v (t)) 



where r is the force of infection, the constant rate at 
which infectious nodes infect a neighbor and \i the re- 
covery rate at which infected nodes become recovered. 
For the purpose of calculations, we have chosen fi = 1 
and a related transmissibiliy T = 1 — e~ r [13 . The ini- 
tial conditions have been set to 0y(O) = 1 — e, pi(0) = 
e/(l — e) and ps(0) = (1 — 2e)/(l — e) for e satisfying 
Gq(1 — e) = 1 — 1/N. The asymptotic (t — > 00) average 
size of outbreak and/or epidemic is then given by 

<*)voiz = JV(l-G (M°°))) . (C2) 



2. Moreno et al. (2004): A compart mental 
approach 

At time t, let Skit) be the fraction of nodes that are 
susceptible (Ik(t) and Rk(t) for the infectious and the 
removed nodes respectively) and of degree k subject to 
the normalization 



^2(S k (t) + I k (t) + R k (t)) =1 



(C3) 



With the probability that any given link pointing to 
an infected node is 



$>A(t) 



e(t) 



^2k'(S k >(t) + I k ,(t) + Rk,(t)) 



(C4) 



and r defined as in Sec. |C 1[ the dynamics obeys the 
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differential equations 
dS k (t) 



dt 
dh(t) 

dt 
dR k (t) 

dt 



-rO(t)kS k (t) 
-I k (t) + re(t)kS k (t) 
h{t) 



(C5) 



Solving this system for the initial conditions 5^(0) = 
(1 - 1/N)p k , h(0) = Pk/N and R k (0) = leads to the 
asymptotic average size of outbreak and/or epidemic 



<s) c = 7V^i4(oo) 



(C6) 



8*( t) is the the corresponding expression for Q(t) (Eq. 
|C4| ) and the integration is carried out under the same ini- 
tial conditions. The asymptotic average size of outbreak 
and/or epidemic is then also 



Note that a slight correction to this model has been 
proposed in [31] in order to take into account that in- 
fectious nodes acquired their infection along one of their 
links and that new transmission is impossible along that 
link. The corresponding asymptotic average size of out- 
break and/or epidemic (s)^ v is obtained in the same way 
as for the compartmental approach except that the factor 
k in Eq. (|C4[) has to be replaced by k — 1. 



(s)* c = nJ2ru™) 



(C8) 



3. An improved compartmental approach 



Incorporating in Eq. ( C5 ) the improvements described 
in Sec |IV| results in a new differential system 



dsm 

dt 

dim 

dt 



dR* k (t) 
dt 



-rQ*(t)kS k (t) 

-P k (t)+re*(t)(k + l)S* k+1 (t) 
+ r(G*(t) + l)[(k + l)l* k+1 (t)-kim 
im +rO*(t) Uk + l)R* k+1 (t) - kR* k (t) 



(C7) 



Both infinite differential systems (C5) and (C7) are trun- 
cated at /c max = 50 in numerical simulations. 
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